Probability Curves Showing Poisson's 
Exponential Summation 

By GEORGE A. CAMPBELL 

IN many important practical operations the constant probability 
of an event happening in a single trial is extremely small, but the 
number of trials is so large that the event may actually occur a suffi- 
cient number of times to become a matter of importance. The curves 
of Figs. 1 and 2 show the probability P of such an event happening at 
least c times in a number of trials for which the average number of 
occurrences is a. The probability range shown is from 0.000001 to 
0.999999 and the average extends from to 15 in Fig. 1 and to 200 
in Fig. 2. An open scale is obtained at both ends, even when the 
probability approaches to within one part in a million of the limits 
and 1, by employing an ordinate scale corresponding to the normal 
probability integral. 

In the practical use of these curves the first question which arises 
is — What number of trials is necessary to make the curves applicable? 
In practice an infinite number of trials, which is the case for which 
the curves are drawn, can never be attained; and if we had absolutely 
no knowledge of the relation between the probabilities for an infinite 
number and a finite number of trials, the curves would have a theo- 
retical interest only. We do, however, know in a general way when a 
finite number of trials approximates to the limiting case; the more 
complete and preci.se our knowledge on this point, the more generally 
useful the curves will become. Without attempting to go into the 
question exhaustively, which would require most careful analysis, a 
general answer will be found to the question as to the number of trials 
required bv plotting the simple functions (a'c)'\ Uc — a — \), and 
hc(c-a-\). 

The characteristic of all probability curves when ;/ is either finite 
or infinite, is shown by Fig. 3, where P(c,n,a) denotes the probability 
of an event happening at least c times in ;; trials when the average 
number of occurrences is a. Any curve P(c,n,a) is contained be- 
tween the ordinates at a=0 and a = n and is asymptotic to these 
ordinates; it cuts P= 1 /i between o = C— 1 and c — 0.3. Thus as n 
decreases from infinity to c, the central portion of the c curve changes 
but little, but the curve is confined to the narrowing band to the 
left of a = n and becomes steeper. On reducing n to c— 1 the c curve 
disappears entirely, since r cases cannot occur; the number of trials 
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n is an integer which cannot be less than the average a or the number 
of occurrences c. 

Fig. 3, making use of Fig. 2 as a background, shows for c = \, 10, 
50, 100, 150 and 200, the curves of the point binomial with n = oo and 
n = c, as heavy full and dashed lines, respectively. Each pair of 
curves, with the exception of the first, crosses in the neighborhood of 
P = $, and, except near this crossing, all of the intermediate curves of 
each family of c curves lie between these extreme curves. The rela- 
tive change in the probability P or 1— P, when these probabilities 
are small, due to reducing n to this lower limit c, for the c curve, is 
great, but the relative increase in the average a is only moderate over 
the greater part of the range covered by Fig. 3. The extreme rela- 
tive change in the average a is shown by dots placed on each of the 
Poisson exponential curves, each dot being located at the point where 
the extreme relative increase in the average is ±.25, ±.50, or ±.75. 
The relative increment in the average ranges, for Fig. 3, from a de- 
crease of 93 per cent at P = . 999999 on c = \ to an increase of 97 per 
cent at P=. 000001 on e = 9 and 10, but the greater part of the field 
is included between the beaded curves for ±50 per cent. Having 
thus obtained, by examining Fig. 3, a general idea of the relative and 
absolute numerical magnitudes of the extreme changes to which the 
probability curves are subject, we are in a better position to make 
practical use of the curves of Figs. 4 and 5 for the small initial shift 
in the curves occurring when the number of trials is finite but still 
large compared with c. 

The rate at which the probability curves start to shift, when the 
number of trials is decreased from infinity, is shown by Fig. 4, which 
gives the value of the first coefficient A in the expansion, in descending 
powers of n, for the relative increment in the average. In the upper 
part of the curves the shift is to the right and in the lower part of the 
curves it is to the left. The point at which the curve remains in- 
itially at rest is shown by the intersection of the c curve with the 
curve for A =0. Since A =35 is the largest arithmetical value occur- 
ring on Fig. 4 and « = 700 will make the first term of the series equal 
1/20, and the next term is then still smaller, it follows that Fig. 2 
redrawn for 700 trials would not show a difference of more than about 
5 per cent in any value of the average. For Fig. 1 the corresponding 
number of trials is 220; it may be shown by direct computation that 
n may even be reduced to the lower limit 1 with only a small 
percentage change Jjn the abscissas of the upper portion of the 
curve c = l. 
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Curves similar to Fig. 4 showing the exact number of trials pro- 
ducing a given relative or absolute shift in the average would be 
useful. Still another variation is shown by Fig. 5 where the curves 
give the first coefficient in the expansion, in descending powers of n, 
of the ratio of the increments in probability, due to a decrease in 
n and to unit increase in c. These curves therefore show the initial 
rate at which any c curve approaches the c+1 curve above it, if the 
scale of ordinates were made linear; below the curve .4=0 the initial 
shift is downward as indicated by the negative sign for the ^4's. If 
sets of curves corresponding to Figs. 1 and 2 were drawn for the 
number of trials w=400 and 2000, respectively, no curve would be 
shifted by as much as the original distance between the curves shown, 
since the maximum values on Fig. 5 up to o = 15 and 200 are 400 and 
10,000, respectively; Fig. 2 shows only every fifth curve; the second 
term of the series indicates that the initial maximum rate of shift 
is not maintained as n decreases at these points. 

The second question arising in connection with the use of the curves 
is their accuracy. Fig. 1 was drawn with the greatest care on a 
scale somewhat larger than that of the reproduction, and errors are 
believed to be only of the order of uncertainty of reading such curves 
with the unaided eye. Fig. 2 was drawn with less skill and shows 
larger deviations but it has proved accurate enough for ordinary 
applications. 2 

The third question which may arise is that of going beyond the 
curves either in range or in accuracy. 3 The exact calculated values 
employed in plotting the curves up to c= 101 are contained in Table II, 
every entry having been independently checked by two persons. 
The greater part of the table was calculated by means of a new formula 
which so expresses the average in terms of P and c as to readily give 
accurate results for the central range of P with large values of c, which 

1 Cf. Soper, H.E., The Numerical Evaluation of the Incomplete B-Function, 1921, 
p. 41, and Fisher, A., Mathematical Theory of Probabilities, 2nd Edition, 1922, p. 276. 

2 These claims for the accuracy of the curves of Figs. 1 and 2 have been confirmed 
by comparison with Pearson's Tables of the Incomplete T-Function, 1922, which 
has been received during the proof-reading of this paper. His tabulated function 
/ («, p) is, in the not ation of the present paper, the probability P corresponding 
to the average a = «V/>-f-l and the number of occurrences c=p-\-\. 

3 When c is not greater than 51, Pearson's tables may be employed. If the prob- 
ability is assigned, as in many practical engineering problems, finding the corre- 
sponding average from the tables requires interpolation. Formula (1) of the present 

fiaper gives the average directly, that is, it gives the inverse incomplete gamma 
unction. The following formula gives c in terms of a: 



= a\\- 



to- i +i(*»+2)a-i- r io»-|-2*)a-5+ 
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is the domain in which the ordinary formulas are not convenient for 
calculation. This is formula (1) below which involved transforming 
the normal probability integral to fit the skew probability summation 
of Poisson's exponential binomial limit. The reason for thinking 
that this transformation would prove useful is made clear by noting 
that in Figs. 1 and 2 the curves become more and more uniformly 
spaced with increasing values of the average a and thus the prob- 
ability approximates more and more closely to the normal probability 
integral, since this is the scale employed for the ordinates. The results 
of the mathematical work are summed up in the following formula: 

For Poisson's exponential binomial limit the average a is expressed 
as a function of the probability P of at least c occurrences by the infinite 
series 

a = c^Q n c-*», (1) 

n = o 

where the coefficients Q„ are functions of the argument t corresponding to 
the probability P expressed in the form of the normal probability integral, 

p =^rJr^ (2) 

twelve of these coefficients are given in the following table: 



Table I Coefficients in Formula (1) for the Average 

n Q„ 

1 

1 / 

2 (/ 2 -l)/3 

3 (/ 3 -7/)/2 2 3 2 

4 (-3/ 4 -7/ 2 - r -16)/2 1 3 4 5 

5 (9* 5 + 256* 3 - 433*)/2 5 3 6 5 

6 ( 12/ 6 - 243/ 4 - 923* 2 + 1 ,472)/2 3 3 6 5 1 7 

7 { -3,753/ 7 -4,353/ 5 + 289,5 17/ 3 + 289,71 It )/2 7 3 s 5 2 7 

8 (270* 8 +4,614/ 6 -9,513/ 4 - 104.989* 2 + 35,968)/2 4 3 9 5 2 7 

9 (-5,139/ 9 -547,848/ 7 -2,742,210/ 5 + 7,016,224* 3 + 37,501,325/)/ 

2113.0527 
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10 (-364,176* 10 -|-6,208,146* 8 +125,735,778/ 6 -r-303,753,831/ 4 

- 672,186,949/ 2 -2,432,820,224)/2 7 3 13 5 3 7 1 ll 

11 (199,112,985/ n + l,885,396,761* 9 -31,857,434,154/ 7 
-287,542,736,226/ 5 -556,030,221,167/ 3 +487,855,454,729/)/ 

2 13 3 14 5 3 7 2 11 

For any given value of P the corresponding value of t in (2) can be 
found from tables of the probability integral. The value of a for 
this value of P and for any value of c can then be determined by (1). 
In this way values of a were calculated for every integral value of c 
from 1 to 101 and for eleven particular values of P: 0.000001, 0.0001, 
0.01, 0.1, 0.25, 0.5, 0.75, 0.9, 0.99, 0.9999, 0.999999. These results 
are presented in Table II. The numerical values of the coefficients 
Qi to Qt, corresponding to the particular values of P used in Table II, 
are given in Table VII. 

From the information given in Table II, two sets of curves were 
drawn, Figs. 1 and 2, the first for each integral value of c in the range 
a = to a = 15 and P = 0.000001. to P = 0.999999, and the second for 
every fifth integral value of c in the range a = to a = 200 and the same 
range of P. From these curves any one of the variables (P, c, a) 
may be found corresponding to assigned values of the other two, sub- 
ject to the practical condition that c is to be an integer. 

Proof 

The well-known expressions for the summation of Poisson's ex- 
ponential binomial limit are: 

_ a c e- a a c + i e~ a a c + 2 e- a 
- P__ 7T + (c+1)! + (c+2)! + •'• 



c-\ 
s = o 

-m/y ,e " da - (3) 
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The series expansion (1) is determined by equating the integrands 
of (2) and (3), 

— 7 -^a c - x e- a da=— 7 =e-**dt, (4) 

r(e) V2tt 

and solving for positive values of a with the condition that t= — o° 
when a = 0. 

Let c=^, a= ~fiQ' Q = eL > 






(5) 



Substituting these values (5) in equation (4), 

L' = be*, (6) 

where V is written for dL/dt. 

30 00 00 oo 

Let L-^Lsb*, M=^M s b*, R=^R s b , Q=^Q&,(7) 

S = 5 = S = i = 

where the coefficients are polynomials in t (constants in the case of 
the series for M). Upon substituting these series expansions for the 
functions in the last equality of (5) and equating coefficients of like 
powers of b, we obtain 

=<2«,-Lo-l, 

=Qi-Li, 

Ro = Q 2 -L 2 -%t 2 +M , 

Ri-Qt-U+Mi, 

Rt-Qi-U+M* 



Rn = Q«+2-L n+2 + M n , (« = 1, 2, 3 . . . ). (8) 

From (5) we obtain Q = e Lo , and then L +l = e Lo , the only real 
solution of which is L o = 0, and therefore, Q = l. 
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Utilizing these initial values we obtain 
Qi=L lt 

Q 2 = L 2 +\UQ U 
Q 3 = L 3 +%L 2 Q l +$L l Q 2 , 



Q„=^ !L ^L n - s Q s , (n = l, 2, 3. ..). (9) 

s=o 

From (6) we obtain L[ = e Ro , and from that £,;=e*.M -**+*•. 

Since L\ is a polynomial in t, and Mo a constant, we must have L{ =»1, 
L\= =M, Mo = 0, that is, Li = t, and hence Qi = t. 

Then L' 2 = R U 



n — i 

iUi-y^^-^+i, (»-l, 2, 3 . . .)• (10) 

5 — 

The next set of coefficients can now be deduced, as follows: 

Ui-iXiGi+11-ift+Mi, 

Z«2=-Rii 

Qt=L a +iLiQi, 

Li=\ L&+\ L x L 2 +\ L?<2i+^i, 

U = t, 

Qi=t, 

U = L 2 t+lt 3 +M u 

But Mi is a constant, and L2 is a polynomial in /. Let 

Lo = Cit 2 +Cit-\-c , 
it being evident that L 2 is of the second degree. Then 

U_ = 2c 2 t+ci. 
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Substituting and equating coefficients of like powers of / 

Hence L 2 = (-t z -2)/G, R x =-t/%, \ 

Mi = 0, (2-. = (/ 2 -l)/3. f 



(11) 



Starting with these initial values equations (8)-(10) are sufficient 
to determine as many coefficients in the expansions (7) as are re- 
quired. In order to demonstrate this, assume that all the coefficients 
up to and including L k , Mk-i, Rk-i* Qk have been determined. It can 
then be shown that the next coefficient in each expansion can be 
obtained from these data, as follows: 

For n = k-\-2, equation (9) can be written 

k 

ft+2 = I*+2+^T2^+^+ / f f,,p Lk+i-sQs+jj-yQk+yt, (12) 

s=2 

where Q k +i, Q k +>- At+i> L k+ o are the unknown quantities. For 
n = k and n = k — l, equation (8) assumes the forms 

Rk = Qk+2-L k+i +M k , (13) 

and Rk-i = Qk+i-L k +i+Mk-i, (14) 

respectively, where all the quantities are unknown except R k -\ and 
Mk-\. For n = k, equation (10) can be written in the form 

*-i , 
L k+l =R k + ]T -j^Rk-sLUi- (15) 

j=i 

where L k+i and R k are the unknown quantities. Substituting in (12) 
the value of (Q k+2 — L k+2 ) found from (13), and then substituting 
the value of R k found from (15) and the value of Q k+l from (14), 

LUi=M k +^^L k+l t+-£~(R k ~ l +L k+x -M k -. l )t 
k k-i 

This is a linear differential equation in L k+1 as a function of /, all the 
coefficients being known functions of / with the exception of M k 
which is an undetermined numerical constant. By a suitable choice 
of the constant M k , (16) may be solved for L k+l as a polynomial in 
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t of the (fe+l)st degree. Rk may then be determined by (15) and 
Qk+i by (14). From these results, the next set of coefficients may be 
found, and so on. The values of the coefficients for k = 2 (L2, Mi, Ri, 
Qz) have been found, and equations (8)-(10) are valid for the par- 
ticular values of n utilized in the above method. Hence the next set 
of coefficients {L 3 , M 2 , R2, Qn) may be found, and in the same way, as 
many more as are desired. The detailed work of the first step is 
indicated below: 

Substituting k = 2 in (16), 

U = M 2 +%L 3 t+KRi+L 3 -M 1 )t+hL2Q2+hRiU. (17) 

Substituting in (17) the values known from (11), 

U = L 3 t+(-t i -2t 2 +2)/36+M 2 . (18) 

Let L 3 be a polynomial of the form (A 3 t 3 +A2t 2 -\-Ait-\-Ao) and sub- 
stitute in (18). Upon equating coefficients of like powers of t, we 
find that ^ 3 = l/36, A 2 = 0, .4 1 = 5/36, A = 0, and M 2 = l/l2. i? 3 
is then obtained by substituting these values in (15) and Q 3 from (14). 
The results are as follows: 

Z,3 = (/ 3 -5/)/36, i? 2 = (/ 2 -5)/36, 



(19) 
M 2 =l/12, Q 3 = (*»- 70/36. ' 

The actual work of computing these coefficients has been performed 
up to and including £ = 11 (In, Mw, Rio, Qu). These results are pre- 
sented in the attached tables: Q n in I, L„ in III, L,', in IV, R n in V, 
and M„ in VI. From this information the next coefficient in the 
series (1), Qu, can be computed by the method outlined above. 

It maybe pointed out in conclusion that the expansion of ^presented 
in Table VI is the asymptotic series obtained in Stirling's expansion 
of T(c), as is to be expected from equations (5). This in itself con- 
stitutes a partial check upon the determination of the coefficients. 

Additional Properties of the Curves 

At the probability P = 0.5, the difference (c— a) = 1/3, approxi- 
mately. 4 Discrepancies are so small as not to be positively dis- 

1 This recalls the approximate rule that the median lines one-third of the distance 
from the mean towards the mode. (Yule, Theory of Statistics, 1911, p. 121.) But 
in the Poisson exponential the median never lies between the mean and the mode; 
the median occurs at the first integer above or below the mean, whichever integer 
corresponds to the c curve cutting P = 0.5 next below the mean, while the mode is 
always at the first integer less than the mean. For the range of cases having a given 
mode, however, the mean and the median are, on the average, greater than the mode 
by J and approximately J, respectively; thus the median must line one-third of 
the distance from the mean towards the mode in the case of the corresponding hetero- 
geneous samplings. 
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cernible on Fig. 1, but Table II gives for c=l, 2, 3 . . . 100, the differ- 
ences (c-a) =0.3069, 0.3217, 0.3259, ... 0.3331, which differ but 
little from 0.3333 . . . , which is approached more and more closely 
for large values of c. 

At P = 0.5 and large values of c the derivative along the c curve is 
dP/da = l/\/2irc, as found by differentiating (3), substituting 
a = c— 1/3 and Stirling's expression for the gamma function. Thus, 
for large values of c the slope of the curve at P = 0.5 decreases nu- 
merically as the square root of c increases. For large values of c the 
curves are approximately straight over the wide range of proba- 
bility shown in the figures. This, in connection with the additional 
fact that the standard deviation \/npq is always equal toy/a for the 
Poisson exponential, is an alternative way of arriving at the ex- 
pression for the derivative given above. 

I am indebted to Miss Edith Clarke for extending the series of 
formula (1) to seven terms, for making all of the original computa- 
tions and for drawing Fig. 1, and to Miss Sallie E. Pero for extending 
the formula to its present eleven terms, and for checking all of the 
preceding work; the single error which she found occurred in the 
seventh term of the expansion where it was without effect on the final 
numerical results. Finally, the work was entirely rechecked, with- 
out discovering additional errors, by Mr. Ronald M. Foster, who also 
put the mathematical work into its present form, pointed out the 
asymptotic nature of the expansion and compared the overlapping 
numerical results with those obtained by direct summation by Miss 
Lucy Whitaker 5 and more recently by Mr. E. C. Molina, as well as 
with his earlier table. 6 

5 Tables for Statisticians and Biometricians, 1914, Table LII. 

6 Computation Formula for the Probability of an Event Happening at Least C 
Times in N Trials, American Mathematical Monthly, XX, June, 1913. p. 193. 
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TABLE III 
n L„ 



1 / 

2 {-t 2 -2)/2 1 Z 

3 (/ 3 + 5*)/2 2 3 2 

4 (-6/ 4 -59* 2 -58)/2 2 3 4 5 

5 (9/ 5 +232/ 3 +5990/2 5 3 5 5 

6 (24* 6 -45/ 4 -817/ 2 +592)/2 4 3 6 5 1 7 

7 ( - 3.753J 7 - 44,853^ - 149.683/ 3 - 418,5830/2 7 3 8 5 2 7 

8 (540* 8 +12,396/ 6 +77,283* 4 +226,939* 2 +217 I 112)/2 5 3 9 5 2 7 

9 (-5,139/ 9 -416,952* 7 -4,411 I 314/ 5 -17 ( 022,320/ 3 -24 ) 039,6190/ 

211310527 

10 (-728,352/ 10 -2,418,858/ 8 +84 ) 239 ) 766* 6 +514,580,817* 4 

+428,031,517/ 2 -2,293,097,728)/2 8 3 13 5 3 7 1 ll 

11 (199,1 12,985/ u +4,293,113,877/ 9 +28,888,236,342/ 7 

+ 124,692,719,238/ 5 +654,335,303 ) 761* 3 +2,373,932 ) 511,173/)/ 

2 13 3 14 5 3 7 2 11 

TABLE IV 
n L' n 



1 1 

2 -t/3 

3 (3**+5)/2 2 3 s 

4 (-12^-59/)/2 l 3 4 5 

5 (45* 4 +696/ 2 +599)/2 5 3 8 5 

6 (72/ 5 -90/ 3 -817/)/2 3 3 6 5 1 7 

7 (-26,271/ 6 -224,265/ 4 -449,049/ 2 -418 1 583)/2 7 3 8 5 8 7 

8 (2,160/ 7 +37,188/ 5 +154,566/ 3 +226 ) 9390/ 24 3 9 5 2 7 

9 (-46,251/ 8 -2 ) 918,664/ 6 -22,056 ) 570/ 4 -51,066,960/ 2 

-24,039,619)/2 11 3 10 5 a 7 

10 (-3 l 641,760/ 9 -9,675,432^ 7 +252,719,298/ 5 +l ) 029 ) 161,634i 3 

+428,031,517/)/2 7 3 13 5 3 7 1 ll 

11 (2,190 ) 242,835* 10 +38,638 ) 024,893* 8 +202,217,654 ) 394/ 6 
+623,463,596,190/ 4 +l I 963 ) 005,911,283* 2 +2,373,932,511,173)/ 

2 13 3 14 5 3 7 2 11 



112 BELL SYSTEM TECHNICAL JOURNAL } 

TABLE V 

n R„ 



1 -//3 

2 (/ 2 + 5)/2 2 3 2 

3 (/ 3 -43/)/2 2 3 4 5 

4 (-21* 4 -49* 2 + 112)/2 4 3 5 5 

5 (45/ 6 +488/ 3 + 787/)/2 5 3 6 7 

6 ( - 1 ,056/ 6 - 32,103^ - 145.6392 2 - 150,452)/2 5 3 8 5 2 7 

7 (-2,727/ 7 +34,773/ 5 +500,803^4-l,282,1030/2 7 3 9 5 2 7 

8 (9,990/ 8 + 112,614/ 6 +62,577/ 4 - 1,193,539^- l,732 1 352)/2 8 3 10 5 a 7 

9 (-28,663,299/ 9 -723,162,744/ 7 -4,907,564,946* 6 

-14,409,H3,392/' , -22,453 ) 298,2910/2 1, 3 13 5 !, 7 1 ll 

10 (12,763,008/ ,0 +897,127,182* 8 +ll,273,606,766/ ,i 

+58,618,777,197/ 4 +161,552,157,577; 2 +172,910,387,072)/ 

2 9 3 14 5 8 7 2 11 





TABLE VI 


n 


M H 








1 





2 


1/12 


3 





4 





5 





6 


- 1/360 


-7 





s 





9 





10 


1/1260 



1 2 3 4 5 6 7 8 9 10 II 12 13 14 15 




5 6 7 

Values of a 

Fig. 1.— Probability Curve* Showing Poisson'i Exponential Summation P=l-A + — + — * j. "'"' \ . -. 
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Values of P 
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